Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Micro-blog clustering and topic word extraction based on hashtag and forwarding relationship
SHU Jue, CHENG Weiqing, DENG Cong
Journal of Computer Applications    2016, 36 (2): 460-464.   DOI: 10.11772/j.issn.1001-9081.2016.02.0460
Abstract487)      PDF (813KB)(863)       Save
Concerning the low accuracy of micro-blog clustering, on the basis of research on the micro-blog data, micro-blog hashtag was used to enhance vector space model, and micro-blog forwarding relationship was used to improve the accuracy of clustering. With the information such as forwarding number, comment number of a micro-blog and information of the user who posted the blog, topic keywords of the clusters were extracted. Clustering results on the experiments of Sina micro-blog dataset show that, compared with k-means algorithm and ICST-WSNB (a short Chinese text incremental clustering algorithm based on weighted semantics and Naive Bayes), the accuracy of the proposed clustering method based on topic labels and forwarding relationship increases by 18.5% and 6.63% respectively; the recall and F-value are also improved. The experimental results show that the proposed clustering algorithm based on micro-blog topic label and forwarding relationship can effectively improve the accuracy of micro-blog clustering, and then get more appropriate topic words.
Reference | Related Articles | Metrics